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Abstract: We present a contrast-maximizing optimal linear representation of polarimetric images ob¬ 
tained from a snapshot polarimetric camera for enhanced vision of a polarized light source in obscured 
weather conditions (fog, haze, cloud) over long distances (above 1 km). We quantitatively compare the 
gain in contrast obtained by different linear representations of the experimental polarimetric images 
taken during rapidly varying foggy conditions. It is shown that the adaptive image representation that 
depends on the correlation in background noise fluctuations in the two polarimetric images provides 
an optimal contrast enhancement over all weather conditions as opposed to a simple difference image 
which underperforms during low visibility conditions. Finally, we derive the analytic expression of the 
gain in contrast obtained with this optimal representation and show that the experimental results are 
in agreement with the assumed correlated Gaussian noise model. 

Index Terms: Polarimetric imaging, Imaging through fog. 


1. Introduction 

Polarimetric imaging produces multi-dimensional pixel data that is either interpreted in terms 
of polarimetric properties of the imaged objects, or quite often, processed into a single image 
revealing specific contrasts which may not appear on standard reflectance images. In simplified 
polarimetric imaging systems, a pair of monochromatic images are acquired along two orthogonal 
polarization directions. The two-dimensional pixel data obtained can then be presented by either 
color encoding or by a combination of the two components so as to enhance contrast between 
objects in a scene sharing different polarimetric properties. Such contrast-maximizing represen¬ 
tation of the polarimetric information can prove helpful in underwater polarimetric imaging |T| and 
imaging through turbid media (2] (like colloids 0, tissues (4] and fog (5]). In most laboratory based 
imaging experiments ei, m, the scene is static and the object of interest is usually embedded in 
a uniform background, thus allowing for processing over multiple frames acquired over a period of 
time. However, in real-world scenarios with fast moving scene and/or camera, it is often desirable to 
reach real-time imaging and processing. This requires the identification and use of computationally 
simple and optimal representations of the polarimetric images that are adapted to the experimental 
scenarios at hand. 

In this article, we address the specific issue of contrast enhancement of an intentionally polarized 
beacon of light (or semaphore), imaged at a long distance through obscured atmosphere with a 



Correlation coefficient p 

0.00 0.25 0.37 0.48 0.58 0.72 0.87 0.95 



Fig. 1: (a) The schematic shows the long range imaging setup.The polarimetric camera consists 
of the arrangement of lenses LI and L2 after a monochromatic filter F (at 612 nm, FWHM = 12 
nm). The image is partly masked by a slit (FM) and passed through a Wollaston prism (WP) to 
produce two images 7y and I± on a single camera (C) using lens L3. (b) A subset of the raw 
images from the polarimetric camera, showing the tower and the source with varying conditions 
of fog density and visibility. The 8 images are sorted in increasing order of background correlation 
(p) estimated over a small ROI surrounding the source pixel. The time stamp of acquisition is 
indicated below each image. 


polarization-sensitive camera. Such situation is of great interest for applications in transportation 
safety. In this context, we demonstrate, both analytically and experimentally, that an optimal 
processing of the polarimetric images allows such contrast maximization under all experimental 
conditions encountered. The optimal polarimetric representation derived differs from commonly 
used polarimetric contrasts, but remains computationally compatible with real-time processing at 
video rate, which is a stringent constraint in the applicative context considered here. 

This article is organized as follows: in Section |2l the experimental setup is described, as well as 
the statistical measure used to assess the contrast of the source in the image. Then, the numerical 
and theoretical derivation of an optimal polarimetric representation is reported in Section |3l The 
efficiency of this optimal representation to enhance contrast of a polarized source through fog is 
then discussed on experimental data in Section 01 allowing us to confirm theoretical predictions 
in real field conditions. Conclusion and perspectives of this article are finally given in Section El 

2. Long distance polarimetric imaging experiment through fog 

The long distance polarimetric imaging experiment described in the following has been set up 
in the vicinity of the campus of University of Rennes 1 to gather experimental data on real 
atmospheric conditions. The imaging experiment covers a kilometric distance which is the typical 
range of distance one aims at for transportation safety applications like air and sea transport. 
A thorough description of this experimental facility and of the snapshot polarimetric imager de¬ 
signed is reported in 0, along with a detailed depiction of the experiment control and calibration 
procedure. 

2.1. Experimental setup 

The experiment basically consists of a highly linearly polarized source of incoherent light, placed 
on a telecommunication tower (about 80 m in height and 1.3 km far from the detection site at 
the laboratory), and a Wollaston-prism based polarimetric camera for imaging. The experimental 
setup reported in ||5) is improved here using a higher dynamics, low noise camera (Andor NEO 
sCMOS 5.5 Mpixels, 16-bit, 0.015 e-/pixel/s dark noise at -30°C sensor temperature) which is more 












































suitable for this experiment. Such high-dynamics detector enables finer sampling of intensity levels 
and noise statistics in the acquired images. The entire imaging system has been thoroughly re¬ 
calibrated with this new camera. As mentioned before and shown in the schematic in Fig.Qal the 
source is imaged from a distance of about 1.3 km, and the snapshot polarimetric imager enables 
the simultaneous acquisition of two images on the camera (namely, Jy and I±) corresponding to 
the two orthogonal polarization directions, with iy aligned with the direction of polarization of the 
source. For the purpose of illustration in this article, we pick a dataset of images taken during an 
experiment conducted on 24-01-2014 between 1:00 p.m. and 2:20 p.m. (at a time interval of 1 
frame/10 seconds). During this 80 minutes acquisition, successive passing fog layers obscured 
the source intermittently, causing the visibility of the source to evolve rapidly and significantly. An 
example of 8 frames acquired by the polarimetric camera can be observed in Fia.fTbl 

2.2. Polarimetric contrast image 

Using the image registration method described in 0, the two images can be extracted to form a 
set of two-dimensional pixels, such that the i th pixel Xf = [x|| ;i , xj_^) T is a part of the polarimetric 
image I P = {Xf } ie [i,jv] = [I\\,I±] T . In practice, this two-dimensional data is processed to provide 
the end-user with a final contrast image, or to feed a higher-level image processing algorithm 
(detection and tracking, segmentation, etc.). For that purpose, the recorded two-dimensional data 
can be represented by a linear combination of both images iy and Ij_, i.e., as a projection of 
the individual vectors Xf over a row vector W = [u, t>]. Thus, a linear representation denoted 
generically by 7 can be written as 


'y = WIp = uI\\+vIj_. 


( 1 ) 


In each such representation, the source will have different contrast and with different overall 
scaling depending on the values of u and v. As a result, we resort to a contrast-to-noise ratio 
(CNR) in order to fairly compare the contrast of the source in each representation. Using a local 
region of interest (ROI) of size 21 x 21 pixels around the source, we identify two sets of pixels 
(shown as colored squares in Fig. [2a] and described in the caption) denoted by B (background) 
and S (source), and we define the CNR of the source in a general representation 7 as 
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where, 
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respectively stand for the empirical mean and variance over region x, with cardinality N x . This 
contrast measure returns the local contrast of a central pixel w.r.t. its immediate background 
and remains invariant under scaling of the gray levels in the image, i.e., for different values of 
[u,u]. In a general case, the intervening medium may be birefringent and thus the values of 
u and v can range between [-1,1]. However, for non-birefringent medium, where no rotation of 
polarization is observed, the weight (u) of Jy remains non-zero and thus can be scaled out so 
that the representation depends only on the weight (v) of I±. Further, for a generic representation 
7 , we define a gain in contrast with respect to an intensity-summed image ( 7 s = I\\ +I±) which 
would be acquired with a standard camera. For brevity, we denote this gain as 
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3. Derivation of an optimal polarimetric representation 

Several combinations of the acquired polarimetric images can be envisaged for producing a final 
contrast image. In the context of this article, we aim at maximizing the CNR of a polarized light 







source over a background. This naturally raises the question of finding the optimal representation 
that provides the best contrast independent of the atmospheric situation, while remaining compu¬ 
tationally efficient to match real-time requirements. Before deriving such optimal representation, 
let us first recall standard polarimetric representations which are commonly used in the literature 
for various applications in polarimetric imaging. 

3.1. Standard polarimetric representations 

From the two acquired polarimetric images I\\ and I±, most simple and standard representations 
are : 


Intensity-summed image fa = [ 1 , 1 ] I P ): 

Such combination qualitatively provides the image that would be acquired with a standard, polarization- 
insensitive camera. Thus, other representations can be compared as a gain with respect to the 
intensity-summed image. 

Polarization filtered image (y\\ = [ 1 , 0] I P ): 

Another very simple approach is to use a polarization-filtered image, which can be obtained on 
our setup by retaining only the polarimetric image corresponding to the direction of polarization 
of the light source. 

Polarization-difference image (y A = [1, -1] I P ): 

Computing a difference image by subtracting the two polarimetric frames acquired is a very 
standard technique, widely used in polarimetric imaging [81 [ 12 ] for its efficiency in contrast 
enhancement. In the first experiments conducted with the imaging system described above, it 
was indeed noticed that the difference image performs generally better than the other standard 
representations (HJ. Nevertheless, it was also observed that 7 a does not always provide the best 
possible contrast in the context of polarimetric imaging through fog considered in this article. 

Orthogonal States Contrast image (OSC = 7a/te1‘ 

This polarimetric contrast is obtained by normalizing the difference image by the intensity-summed 
image. It is widely used and has proved efficient in active polarimetric imaging for its ability to 
provide an estimate of the degree of polarization of light scattered by an object (or an imaged 
scene) m, m However, it was shown that such normalization tends to increase the noise in 
the final image due to possible low intensity values in the intensity-summed image 0. For that 
reason, and due to the fact that the OSC cannot be written as a linear combination of I\\ and I±, 
this representation will not be considered in the remainder of this article. 

In the next subsections, we derive an optimal polarimetric representation, which in general 
differs from the most classical ones, whose expressions are summarized in Table [B 

3.2. Computational representation: Numerical maximization of CNR 

It is possible to determine the optimal projection numerically for each acquired frame by a sim¬ 
ple grid search over possible coefficient vectors W. As a result, the optimal representation, 
denoted Wes = [ucs, v C s ] in the following, is obtained by computationally solving Wes = 

argmax{C(WTZp)}, for u e [ 0 , 1 ] and v e [- 1 , 1 ] and obtaining an image yes = Wcslp- 

w _ 

On the resulting processed images of two individual frames in Fig.EH on the ROI defined above, 
it can be checked that a significant contrast enhancement can be obtained over the difference 
image y A and the intensity-summed image y s , which indicates that the optimal representation 
yes differs, in these cases, from both y A and y s . It can be observed that for the frame labeled 
[a], the optimal representation is very close to a polarization-filtered image 7 y. On the other hand, 
for the frame [b], the computational search leads to optimal weight of ves corresponding to an 
intermediate situation between representations y A and y\\. 



Fig. 2: (a) Comparison of the contrasts obtained for three representations of the polarimetric 
images for frames labeled as [a] and [b] in Fig. [Tbl The source region, S is bounded by the 3 x 
3 pixels red square and B is the background region between the two blue squares of sizes 11 x 
11 pixels and 21 x 21 pixels, (b) CNR-maximizing ID search over values of v keeping u = 1 for 
the two frames. 


3.3. Optimal representation: Theoretical maximization of CNR 

As observed in the previous subsection, the optimum linear combination of the polarimetric images 
for contrast enhancement may differ from the commonly used polarimetric representations, and 
may vary from frame to frame. Accordingly, identifying the physical parameter that influences 
the weights of the optimum linear combination would make it possible to implement an adaptive 
representation of polarimetric image that provides the best contrast for any weather condition. 
However, in such a long distance imaging setup, there is no a priori knowledge of the properties 
of the intervening medium, and thus we rely on the noise properties of the image. We hypothesize 
a correlated Gaussian noise model treating each pixel Xf as a bivariate random variable having 
a mean of ( X p ) s = [s + 6/2, b/2] T at the source location and ( X P ) B = [b/2,b/2] T outside the 
source location. Here, s and b denote the mean intensities of the highly polarized source and the 
depolarized background, respectively. The second-order statistical properties of X p are modeled 
by the covariance matrix 1 / defined as 

r* = ({Xf - {Xf)) {xf - {xf)) T ) = | 0 0 , (4) 

where e* stands for the standard deviation of the overall multiplicative optical noise, which is likely 
to be partially correlated in polarimetric channels especially in snapshot imaging. The correlation 
coefficient is denoted by p. With such statistical noise model, the theoretical expression of the 
CNR of a generic representation 7 has been derived in Appendix. The obtained expression can be 
easily and analytically maximized w.r.t. p, which indeed leads to the following linear representation 
7 ml = WmlIp with W M l = [1 ,-p]. This provides a simple adaptive representation where the 
background noise correlation coefficient, p, which can be in practice estimated locally over the 
region B using the following empirical estimator 

&b(X\\)o^{X p ) 

where B still denotes the background region comprised between the two blue squares of sizes 
11 x 11 pixels and 21 x 21 pixels depicted in Fig.[2al This representation is denoted as 7 ml (for 
maximum likelihood) since, for the experimental conditions at hand (with highly polarized source 
and completely depolarized background), its form could be equivalently derived from a likelihood 
maximizing approach (9j. Using this representation on the same dataset, it can be seen in Fia.l2ala 
that 7 cs and 7 ml consistently provide enhanced contrast compared to other simple polarimetric 











TABLE I: Polarimetric representations and gains in CNR. 


Representation 

Symbol 

W 

9 2 = C(7)/C(7e) 

Intensity-summed 

7s 

it i] 

1 

Pol. filtered 

7|| 

[1,0] 

a/ 2(1 + p) 

Pol. difference 

7A 

[1, -1] 

V(i + p)/(i-p) 

Computational 

7 CS 

[ 1 , vcs] 

- 

Max. Likelihood 

1ML 

[1, -p] 

\/2/(l - P) 


representations. As can be observed in Fig. |2alb, these representations are almost equivalent, 
the small discrepancy between each other being due to numerical errors in the computation of 
the estimators and/or to possible deviation of the actual statistics from a Gaussian model. This 
result is verified over the entire dataset and further discussed in the next section. 

4. Results and Discussion 

4.1. Experimental results 

In Fig.|3l we plot the time evolution of the gain in CNR defined in Eq.© for each representation. 
For reference, the CNR C(te) is shown as black-dotted line in the bottom of Fig.|3l The comparison 
confirms that the difference image 7 a (solid red lines) is not always the best representation and 
in many cases is outperformed by a simple ‘polarization-filtered’ image 7 ||, i.e., the raw J|| image 
(solid, filled green). Furthermore, 7 cs (solid black), presents the best contrast gain, and in general 
differs from both 7 A and 7 ||. It can be clearly observed that the gain in source contrast in the 
ML representation ( 7 ml) closely follows the best possible gain obtained with 7 C s, i.e., with a 
computational search over all possible linear combinations. As a result, this simple analytical 
representation behaves adaptively to present the best source contrast in the final image for all fog 
density conditions. These experimental results also quantify the advantage in using a polarimetric 
camera for long distance contrast-enhancement of a polarized beacon through fog, as the CNR 
gain rises from 2 -fold to a maximum of 12 -fold compared to an intensity-summed image which 
is qualitatively similar to an image obtained from a standard intensity camera. It must be noticed 
at this level that the noise statistics of an intensity-summed image may differ in general from 
the ones obtained with a true intensity imager. As shown in Appendix, a fair comparison with a 
true intensity imager would imply a gain comprised between 2 and 6 -fold, for the noise model 
considered. 


4.2. Influence of p and theoretical gains in CNR 

As stated in the previous section, the derivation of the ML representation allowed us to iden¬ 
tify the background correlation, p, as a crucial factor in determining the optimal contrast linear 
representation. In the above framework it is straightforward to calculate the theoretical CNR for 
each representation and thus compute the functional dependence of the gain in contrast with the 
correlation parameter (see Appendix). The theoretical forms are tabulated in Table Bland plotted 
as solid lines in Fig. |4al Furthermore, the experimentally generated contrast gains are plotted 
alongside (scattered symbols) as a function of locally estimated background correlation. The plot 
shows that the CNR gain for each representation depends on p in an orderly fashion which was 
not obvious in the ‘noisy’ time-series data in Fig. |3l It is interesting to notice that the difference 
image 7 A gives the best CNR with high values of p, but is outperformed by the polarization-filtered 
image 7 -y as the value of p falls bellow 0.5. The performance of 7 || remains linear with p, with 
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Fig. 3: Gain in contrast (log scale) reached by the difference image 7 a (red solid lines) and 7 cs 
representation (black solid line) w.r.t the intensity-summed image 7 s. The blue crosses show 
the gain obtained with 7 M l representation, which can be seen to follow the maximum attainable 
contrast. The green filled curve shows the gain of the 7 y image with respect to the intensity- 
summed image. 



Fig. 4: (a) The experimentally obtained gain in CNR (scattered points) for each representation is 
plotted in log scale along-side the corresponding functional forms (solid lines) of the gain listed 
in Table [D (b) Comparison of the intensity-summed image (left, [a]) with a processed CNR map 
of 7 ml (right, [b]) that can be provided as a final contrast enhanced image to the end-user or to 
a higher-level image processing unit. 


a maximum gain of 2 for high values of p, while that of 7 a rises steeply when p 1 . Again, 
the optimality of the ML representation is clearly seen, as it corresponds to the best contrast 
representation for all values of p. 

The p-dependant performance of the studied representations (particularly 7 A and 7 ml) can be 
interpreted by noting that, in the present context, p is a measure of the visibility of background 
structure in the local scene or local non-uniformity in reflectance. This can also be checked on 
the frames shown in Fig. QH that are sorted in increasing order of p. As a result, the main benefit 
of 7 a relies in its ability to suppress highly structured (and thus correlated) background in the 
final image. This property is retained by the 7 ml, which identifies with 7 A for p -» 1 . On the 
other hand, with uncorrelated background (p = 0 ), the perpendicular image I± does not bring 
any further information, making 7 | optimal during very low visibility conditions. The experiment 
also quantifies the advantage in using a polarimetric camera as we observe a maximum CNR 
gain of 12 -fold compared to the intensity-summed image which is qualitatively similar to an image 
obtained from standard intensity camera. 






















































4.3. Implementation of optimal representations 

Finally, we briefly discuss the implementation and some generic issues associated with computing 
such contrast-maximizing representations for each frame, when the locations and number of 
sources are unknown. In this case, each frame can be processed to obtain an intermediate 
contrast image that isolates the polarized source from the rest of the image as shown in Fia.l4bl 
To obtain the CNR image, the calculation of CNR over a sliding window can be replaced with 
a convolution approach which performs quickly by using Fast Fourier Transforms. For computing 
7 cs, multiple CNR images must be generated and the maximum value at each pixel must be 
chosen to form the final image. In contrast, for the ML representation one would need fewer 
Fourier transforms to calculate the local correlation coefficients, thus providing 7 ML a noticeable 
time advantage over 7 C s- However, with fast computers or FPGA based embedded system, both 
techniques should remain within real-time requirement. Another parameter of importance is the 
size and shape of ROI, which should also be varied to maximize the CNR over multiple scales, 
especially when the spread of the source is unknown. The ML method remains specifically suitable 
in this case as only a ID maximization over scale is required as opposed to a search over both 
scale and linear weighting of polarimetric images. 

5. Conclusion 

In this article, we first showed experimentally that when performing real field polarimetric imaging, 
two polarimetric channels acquired along two orthogonal polarization directions can have intensity 
fluctuations that are significantly correlated. Moreover, experiments reveal that the contrast of a 
polarized light source under any visibility condition can be maximized using a linear combination 
of the two acquired polarimetric images, which differs in general from the standard polarimet¬ 
ric representations used in literature. Linder a correlated Gaussian noise hypothesis, we also 
demonstrated that the optimal representation is simply related to the noise correlation coefficient, 
which is also observed experimentally. As a result, such computationally-efficient representation 
can replace a numerical search of the optimal weighting coefficients, and could thus be easily 
implemented in real-time applications as a pre-processing task for automated detection/localization 
on wide field images. Lastly, the results presented here could be easily generalized to any case 
of partially polarized source and background with finite detector noise, which could be of interest 
for underwater imaging or imaging in biological tissues. 


Appendix 

As noted above, the polarimetric pixel can be written as a 2-D random vector X p = [x^,x ± ] T , 
where the dependency in pixel location i is omitted in the appendix for the sake of clarity. Assuming 
a correlated Gaussian noise model and keeping the same notations as in Section l2lil we derive 
the expression for CNR for a generic polarimetric representation 7 . Its mean value at a given 
location in the image is directly given by ( 7 ) = W(X P ), and its variance reads <j 2 (j) = WTW T . 
From the definition of the CNR in Eq.{2}, a straightforward calculation yields 



As a result, the gain in CNR with respect to the intensity-summed representation 7 ^ reads 


s _ ^( 7 ) _ / 2u*(l +p) 


C( 7 s) y u 2 + 2puv + v 2 ’ 

from which the gain expressions of Table (Dare easily derived. 







If one now considers a true intensity imager, and given the notations above, the intensity level 
X 1 recorded at a given pixel would have a mean value of (X^s = s+b at the source location, and 
(X 7 ) b = b in the background region, with variance cr 2 pf J ) = e 2 . The CNR is thus C(X I ) = s/e, 


and as a result, 




C( 7 ) _ C( 7 ) c ( 7E ) 


C(X*) C ( 7E ) C(* j ) vT + 7' 


(A-6) 


This shows that the gain with respect to a true intensity imager is equivalent to the gain with 
respect to the intensity-summed image only when the correlation parameter p tends to 0, and is 
lower by a factor of 2 when p approaches unity. 
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